TEXTURE DESCRIPTION METHOD AND TEXTURE-BASED IMAGE RETRIEVAL 
METHOD USING GABOR FILTER IN FREQUENCY DOMAIN 



BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a texture description method of an image, 
and more particularly to a texture description method for transforming an image of a 
time domain into an image of a frequency domain and extracting texture features by 
Gabor filtering. Also, the present invention relates to a texture-based method of 
retrieving images indexed by the texture description method. 

2. Description of the Related Art 

Texture information and its application as an indication of important visual 
features of an image, have been studied for a long time. The texture information of 
an image is used as a low level descriptor for content-based indexing and 
abstracting an image or video data. Also, the texture information of the image is 
important in retrieving a specific photo of a digital photo album, or content-based 
retrieving in a tile or a textile database. 

Presently, feature values are calculated in a time domain or a frequency 
domain in order to extract texture features of an image. In particular, a method of 
texture feature extraction in a frequency domain is known to be suitable for 
describing the texture features of images of a wide variety of forms. 

A thesis on this method, entitled "Texture Features of Browsing and Retrieval 
of Image Data", by B. S. Manjunath and W. Y. Ma, published on IEEE Transaction 
on Pattern Analysis and Machine Intelligence, Volume 18, No. 8, on August 1996, 
describes a method for calculating feature vectors by extracting from the image 
obtained after Gabor filtering in the frequency domain, the mean and the variance of 
each channel as feature values of the texture of an image. 

However, the image texture description method using the conventional Gabor 
filtering has problems. First, it takes a long time for calculation by performing the 
Gabor filtering of an image in a signal domain. Second, it is difficult to obtain 
enough information because the density of frequency samples of an image is low in 
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the case where the texture information is extracted using the Gabor filter having a 
narrow pass band in a low frequency domain due to the use of an orthogonal 
frequency domain. Third, the size of data needed to describe features is great 
because both the mean and variance of an image brightness value are used as the 
texture features of the image. 

SUMMARY OF THE INVENTION 

To solve the above problems, it is an objective of the present invention to 
provide a texture description method and a texture-based image retrieval method in 
which the Gabor filtering is done in the frequency domain rather than the signal 
domain, so as to reduce a filtering calculation time. 

It is another objective of the present invention to provide a texture description 
method and a texture-based image retrieval method using a Gabor filter in a 
frequency domain for extracting enough texture information of an image in a low 
frequency domain of the image by raising the sampling density of the frequency, and 
extracting enough texture information in spite of the Gabor filtering having a wide 
pass band by lowering the sampling density of the frequency in a high frequency 
domain, by performing the Gabor filtering using a polar coordinate system. 

It is another objective of the present invention to provide a texture description 
method and a texture-based image retrieval method in a frequency domain for 
raising a retrieval rate using the average brightness values of an image, the 
variance of the brightness value of an image, energy, and a variance value of an 
energy for a feature descriptor of an image texture. 

it is another objective of the present invention to provide, in consideration of 
human visual features, a texture description method and a texture-based image 
retrieval method using a Gabor filter in a frequency domain, by designing a 
frequency pass band of a low frequency domain to be narrower and the frequency 
pass band of a high frequency domain to be wider as the frequency domain 
becomes higher. Here, it is taken into consideration that the human visual system 
sensitive to changes of the low frequency components of an image and insensitive 
to changes of the high frequency components, when designing a Gabor filter. 



It is another objective of the present invention to provide a texture description 
method and a texture-based image retrieval method using a Gabor filter in a 
frequency domain for easily retrieving an image changed by a minor amount by 
Gabor filtering so as not to change a value of an image texture descriptor if an 
image is rotated, magnified, or reduced by a small amount. 

To accomplish the above object of the present invention, there is provided a 
texture description method using a Gabor filter in a frequency domain including: 

a first step of transforming an image of a time domain into an image of a 
frequency domain; 

a second step of filtering the transformed frequency domain using a Gabor 
filter having NxM filter regions, where N and M are predetermined positive integers; 

a third step of extracting texture feature values of the Gabor filtered image in 
respective channels of a frequency domain division layout corresponding to the NxM 
filter regions of the Gabor filter; and 

a fourth step of determining a texture descriptor of the image on the basis of 
the texture feature values of the image. 

Preferably, in the first step, the image of the time domain is Fourier- 
transformed into an image of the orthogonal coordinate system frequency domain, 
or the image of the time domain is Radon-transformed, and then one dimensional 
Fourier-transformed into an image of a polar coordinate system frequency domain. 

Also, there is provided recording media which can be read by a computer in 
which is recorded a program for executing a texture description method using a 
Gabor filter in the frequency domain. 

Also, there is provided a texture-based image retrieval method using a Gabor 
filter in a frequency domain including: 

a first step of extracting a data texture descriptor by filtering data images 
using a Gabor filter and storing the extracted data texture descriptor in a data base; 

a second step of extracting and storing a query texture descriptor of a query 
image, i.e., a sought after image, when the query image is inputted using the Gabor 
filter; 
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a third step of matching the data texture descriptor and the query texture 
descriptor and measuring a distance between two texture descriptors; and 

a fourth step of determining a similarity between two images according to the 
distance between two texture descriptors. 

Preferably, in the first and second steps, the step of extracting the data 
texture descriptor and the query texture descriptor includes: 

a first sub-step of transforming an image of a visual domain into an image of 
a frequency domain; 

a second sub-step of filtering the image of the frequency domain using a 
Gabor filter having NxM filter regions, where N and M are predetermined positive 
integers; 

a third sub-step of extracting texture feature values of the Gabor-filtered 
image in respective channels of the frequency domain division layout corresponding 
to the NxM filter regions of the Gabor filter; and 

a fourth step of describing the image texture descriptor using the texture 
feature values of the image. 

Also, there is provided recording media which can be read by a computer in 
which is recorded a program for executing a texture-based image retrieval method 
using a Gabor filter in the frequency domain. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objective(s) and advantages of the present invention will become 
more apparent by describing in detail a preferred embodiment thereof with reference 
to the attached drawings in which: 

FIG. 1 is a flowchart illustrating a texture description method in the frequency 
domain according to a preferred embodiment of the present invention; 

FIG. 2 illustrates a frequency domain division layout used for extracting 
texture descriptors of respective channels; and 

FIG. 3 illustrates a structure of a Gabor filter used for extracting texture 
descriptors of respective channels. 
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DETAILED DESCRIPTION OF THE INVENTION 



Hereinafter, "a texture description method and a texture-based image 
retrieval method using a Gabor filter in a frequency domain" according to a preferred 
embodiment of the present invention will be described in detail with reference to the 
attached drawings. 

FIG. 1 is a flowchart illustrating a texture description method in the frequency 
domain using a Gabor filter according to the present invention. 

The texture description method, a method for producing a texture descriptor 
by processing an input image is used for texture information-based indexing and 
texture information-based retrieving an image. That is, when images, which will be 
indexed in the database and stored, are input, data texture descriptors are produced 
according to the texture description method of FIG. 1, and the produced texture 
descriptors are stored in the database. Also, when a query image is inputted, 
texture descriptors of the query image are produced according to the texture 
description method of FIG. 1, and retrieval is performed by comparing the produced 
texture descriptors with the data images stored in the database. 

Referring to FIG. 1, the texture description method according to the present 
invention will be described in greater detail. 

First, a prescribed image is inputted (S11), and the input image is Fourier- 
transformed (S12) into an image of an orthogonal coordinate system or a polar 
coordinate system frequency domain. Here, the input image can be a data image or 
a query image as described above. The input image is two-dimensional Fourier- 
transformed into data of the orthogonal coordinate system frequency domain. 
Alternatively, the input image is Radon-transformed, and then the transformed data 
is one-dimensional Fourier-transformed into data of the polar coordinate system 
frequency domain. 

The process of transforming the input image into the polar coordinate system 
frequency domain is described as follows. First, the input image is Radon- 
transformed, wherein the Radon transform is a process of line-integrating a two- 
dimensional image or multi-dimensional multimedia data with respect to an angle to 
obtain one-dimensional projection data. That is, the shape of an object appears to 



change with respect to different viewing angles, and an object is seen from all 
angles, so that an outline of the object can be estimated. The Radon transform uses 
this principle. 

A Radon transform formula for transforming the two-dimensional image is 
expressed as Formula 1 . 

Here, f(x,y) is an image function of an orthogonal coordinate system time 
domain, p8(R) is a first projection function obtained by linear-integrating with respect 
to an axis having an angle of #with the positive x-axis, and passing through the 
origin of the orthogonal coordinate system, that is, a first Radon-transform function. 
5(x) is a function which becomes 1 when x is 0. A two-dimensional image has a 
region of -<~<x,y<«> in the orthogonal coordinate system, and regions of 0<s<«> and 
O<0<n\r\ the Radon coordinate system. When xcosO+ysinO=s, <5(xcos0+ysin0-s)=1 . 

In this way, a concurrence of the first Radon-transform function pg(R) 
obtained by rotating 8 from 0 degrees through 180-degree is called a signogram. 
The signogram is then Fourier-transformed so as to give a relation such as that 
shown in Formula 2 with a two-dimensional Fourier-transformed image function 
f(x,y) in the orthogonal coordinate system time domain. 

P(co,d)= F{co cose, co sinO) = F(o) x> a y j\ ax=a>CQ&6(0y=as ^ e -(2) 
Here, P((o,d) is a Fourier-transformed function of the Radon-transform 
function p^R). In addition, co is ^w x 2 + w y 2 , and 8 Is tan _1 (ft» / o) x ) . 

According to the Central Slice theory, the Fourier transform of the signogram 
is a one-dimensional function value obtained by cutting the Fourier-transformed 
function of a two-dimensional original image with respect to each <9axis. If this 
image is Radon-transformed and then Fourier-transformed, the image is 
transformed into an image of the polar coordinate system frequency domain. 

Next, in a step S13, filtering is performed using the frequency domain division 
layout as described in FIG. 2 in order to extract texture descriptors of respective 
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channels. In FIG. 3, the frequency domain division layout of FIG. 2 is illustrated in a 
more practical form. Gabor filters for filtering in the step S13 can be understood to 
be designed on the basis of the divided feature channels as illustrated in FIG. 3. 
That is, in the preferred embodiment, the Gabor filtering is performed using Gabor 
filters designed on the basis of a 5x6 feature channel spaces divided into 5 regions 
in the radial direction and 6 regions in the angular direction. Here, respective 
divided frequency domains shown in FIG. 2 correspond to the feature channels. 

The response features of the Gabor filter are expressed as Formula 3 in 
order to explain the operation of the Gabor filter designed as described above. 
Here, 

4 s , r is G Psr A sr exp 
a 

constant, G p (g),0) indicates the response features of a filter corresponding to the 

feature channels of the s-th radial direction and the r-th angular direction, s 
indicates a position in the radial direction as an integer number among {0,1,2,3,4}, 
and r indicates a position in the angular direction as an integer number among 
{0,1,2,3,4,5}. Also, a p 2 and o e 2 , respectively, are standard deviation values of 

feature channels of the coordinate p in the s direction and the coordinate 0 in the r 
direction, and correspond to the width of the feature channels of the radial direction 
and angular direction. 

In the Gabor filter having 5x6 filters as the preferred embodiment, exemplary 
standard deviations of the feature channels are tabulated in Tables 1 and 2. 
Variable values of the Gabor filter in the radial direction are tabulated in Table 1 , 
and variable values of the Gabor filter in the angular direction are tabulated in Table 
2. 



Table 1 



Radial index (s) 


0 


1 


2 


3 


4 


Centre frequency 


3 


3 


3 


3 


3 


(*>*) 


4 


8 


16 


32 


64 



-{co-co s f^ 








exp 


2^/ 
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Octavs 
bandwidth (B s ) 


2 


1 

4 


1 
8 


1 

16 


1 

32 




1 


1 


1 


1 


1 


V21n2 


V21n2 


^2 In 2 


3 -^21n2 


^2 In 2 



Table 2 



Angular index (r) 


0 


1 


2 


3 


4 


5 


Centre frequency 


0° 


30° 


60° 


90° 


120 c 


150° 


Angular 
bandwidth 


30° 


30° 


30° 


30° 


30° 


30° 


ff 0r 


30° 


30° 


30° 


30° 


30° 


30° 


V21n2 


V21n2 


V2ln2 


V21n2 


V2ln2 


V21n2 



Next, in a step S15, the texture features of images Gabor-filtered in the 
orthogonal coordinate system frequency domain or in the polar coordinate system 
frequency domain are extracted. Here, the orthogonal coordinate system frequency 
domain or the polar coordinate system frequency domain are divided on the basis of 
the human visual system (HVS) as illustrated in FIG. 2, and the divided frequency 
domains are each called feature channels. The feature channels are indicated as 
C„ where i=6*s+r+1, and C 0 indicates a DC feature channel. 

The polar coordinate system frequency domain division layout divides the 
frequency domain on the basis of the HVS. The -3 db pass band frequency 
features of the Gabor filter are designed to be disposed in a frequency domain to be 
suitable for the HVS. The frequency domain division method and design principle 
for the Gabor filter are similarly applied to the orthogonal coordinate system. That 
is, a feature of the HVS is that is sensitive to the low frequency components and 
insensitive to the high frequency components, and the frequency division layout is 
decided using these features. Hereinafter, this will be described in greater detail. 

In the present -invention, the energy mean value and energy variance value of 
the Gabor-filtered frequency domain are used as the texture features of the images. 

FIG. 2 illustrates a polar coordinate frequency domain division layout for 
extracting the mean of energy on the basis of the HVS. 



As shown in FIG. 2, the polar coordinate frequency domain is divided in the 
radial direction and the angular direction. The polar coordinate frequency domain is 
divided in the radial direction away from the origin at an octave interval, and in the 
angular direction 6 is divided into 180/P (here, P is a division resoiving power of 6). 
If the polar coordinate frequency domain is divided like this, in the polar coordinate 
frequency layout to extract the mean of energy, the low frequency domain is densely 
divided, and the high frequency domain is sparsely divided. The divided frequency 
domains are feature channels (Q, and the hatched portion is the feature channel 5. 

Here, important features of the present invention can be seen. Sampling of 
the low frequency domain is densely performed and sampling of the high frequency 
domain is sparsely performed by the Radon transform of the present invention, and 
when they are divided on the basis of the HVS, the low frequency domain is densely 
divided and the high frequency domain is sparsely divided. The respective divided 
frequency domains, that is, the feature values extracted from respective channels 
accurately reflect the texture features. 

When the energy mean value and energy variance value of the respective 
channels are obtained, image texture descriptors for describing an image texture 
from the feature values in the step S15, that is, the feature vectors, are calculated. 
A method for obtaining the energy mean value and energy variance value will be 
described later. 

The texture descriptors are expressed as Formula 4. 

F= [f STD J DC ,e(\\...,e(30),d{\),...,dOQ)} ...(4) 

Here, e(i) is the energy mean value of the i-th Gabor-filtered channel in the 
frequency layout of FIG. 2, and d(i) is the energy variance value of the i-th Gabor- 
filtered channel I in the frequency layout of FIG. 2. Here, in particular, f DC indicates 
an energy of the DC channel, and f STD indicates a variance of all of the pixel values 
of the image. The respective feature values of Formula 4 can be first described 
according to the order of priority of the channels, and the feature values of the 
channels having a low importance are excepted according to the importance of the 
channels, so that the amount of data can be reduced. Also, the texture descriptor 
feature vectors can be formed using only an energy of the respective channels as 



the feature values according to the importance of the feature or using both the 
energy and energy variance. 

The energy mean value (e(i)) and the energy variance value (d(i)) forming the 
described feature vectors are obtained by Formulas 6 and 8, and in doing so, a 
value p(i) is obtained using a Gabor-filtered function ( G p ^(a>,0)- P(a>,0) ) after 

the Fourier-transform in Formula 5. If p(i) is applied to Formula 6, the energy mean 
value (e(i)) can be obtained. Also, in Formula 7, a value q(i) is obtained using the 
Fourier-transformed first Radon transformed function and the p(i) value obtained in 
Formula 5. If the q(i) is applied to Formula 8, the energy variance value (d(i)) can be 
obtained. 

1 360° 

P(0 = I I [G p Ja,0)-P(cD,e)T ...(5) 

-a=0+ 9=0°+ 

e(i)= log[l +/>(/)] ...(6) 



| i iio° 

<K0 = JX Y.{\G Ptf (a>,eyp{a,,d)Y - pQ)Y -V) 

V <u=0+ 0=0°+ 

d(i)= log[l +?(/)] ...(8) 

In this way, the texture descriptors formed of the energy mean value and the 
energy variance value of the respective channels are obtained. 

The step S1 1 or S16 is performed repeatedly on all input images, and the 
respective data texture descriptors are stored in the database. 

The data texture descriptors stored in the database are matched with the 
query texture descriptors obtained from the query image and used for retrieving 
images similar to the query image. Hereinafter, the texture-based retrieval method 
according to a preferred embodiment of the present invention will be described. 

It is supposed that the database is indexed using the data texture descriptors 
(F). Next, the query texture descriptor (F q ) is extracted according to the texture 
description method of FIG. 1, which describes the query image (q), and a conformity 
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is measured by calculating a similarity between the texture descriptor (F d ) and the 
query texture descriptor (F q ) of arbitrary image data (d) within the database. 

This similarity is inversely proportional to the distance (D m ) between two 
texture descriptors obtained by Formula 9. 

d{q,d) = distance(F ? , J F c/ ) = ]T 

k 

Here, F={f(k),k=1 ,K), and co(k) indicate weight coefficients of each channel. 
In this way, the distance between the data texture descriptor and the query texture 
descriptor is obtained by comparing the mean with the variance by each channel. 
The result of Fourier-transforming the rotated image is the same as the result of 
rotating the image in the frequency domain during the Fourier-transform of the 
image before the rotation. Therefore, when comparing two images and retrieving, if 
comparing the images during the rotation in the frequency domain, two similar 
images having different rotation angles can be found. The rotation unchangeability 
is expressed as Formula 10. 

d(q,d,m0) = distance^ ...(10) 
Here, c|) = 30°. 

After the distance between two images is obtained by rotating the query 
image in the frequency domain and comparing the query image with the data image, 
the minimum value of the distance is used to indicate the amount of dissimilarity 
between the two final images. This is expressed as Formula 1 1 . 

d(q,d) = minimum of {d(q,d,m^)|m = 1 to 6} ...(11) 

Also, the result of Fourier-transform of the scaled image is the same as 
scaling, in the frequency domain, the result of Fourier-transformation of the original 
image. When comparing two images, if comparing the images during the scaling of 
the images in the frequency domain, two similar images having different sizes can 
be found. The scale invariance is expressed as Formula 12. 

d(q,d,n) = distance^ (£)| n ,F d (k)) ...(12) 
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Here, n is possible number of scale changes for the query image. According 
to the preferred embodiment of the present invention, a texture-based image 
retrieving method includes the steps of scaling the query image in predetermined 
scales in the frequency domain, extracting the texture descriptor for each scale, 
calculating the distance between those texture descriptors with the texture 
descriptor of data image, and taking the minimum distance as the distance between 
the two images. The step of taking the minimum distance can be expressed as 
Formula 13. 

d(q y d)= minimum of {d(q,d,n)|n = l to n}...(13) 
Here, N is actual number of scaled images. 

Though the present invention has been described on the basis of the 
preferred embodiments, the preferred embodiments are not intended to limit but to 
illustrate the present invention. It is obvious that various changes, amendments, or 
controls can be made by those skilled in the art without departing from the spirit and 
scope of the invention. Therefore, the present invention will be defined only by the 
appended claims, and the above examples of changes, modifications, or controls 
must be understood to be included in the present invention. 

According to the present invention, not only can the texture of an image be 
described more accurately, but also effective indexing and retrieval becomes 
possible using the Gabor-filtering method in the frequency domain, the polar 
coordinate system frequency domain division layout suitable for extracting 
respective feature values, the method for extracting feature values in respective 
frequency domains, and the techniques for granting the importance and the order of 
priority to respective frequency channels. 

The image texture descriptor extracted by the texture description method is 
very useful for image retrieval when searching for an image having a special feature 
in a huge aerial photograph and a radar image for military purpose. 
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